Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 45
Filter
1.
Sci Rep ; 14(1): 8708, 2024 04 15.
Article in English | MEDLINE | ID: mdl-38622173

ABSTRACT

Recent work has revealed an important role for rare, incompletely penetrant inherited coding variants in neurodevelopmental disorders (NDDs). Additionally, we have previously shown that common variants contribute to risk for rare NDDs. Here, we investigate whether common variants exert their effects by modifying gene expression, using multi-cis-expression quantitative trait loci (cis-eQTL) prediction models. We first performed a transcriptome-wide association study for NDDs using 6987 probands from the Deciphering Developmental Disorders (DDD) study and 9720 controls, and found one gene, RAB2A, that passed multiple testing correction (p = 6.7 × 10-7). We then investigated whether cis-eQTLs modify the penetrance of putatively damaging, rare coding variants inherited by NDD probands from their unaffected parents in a set of 1700 trios. We found no evidence that unaffected parents transmitting putatively damaging coding variants had higher genetically-predicted expression of the variant-harboring gene than their child. In probands carrying putatively damaging variants in constrained genes, the genetically-predicted expression of these genes in blood was lower than in controls (p = 2.7 × 10-3). However, results for proband-control comparisons were inconsistent across different sets of genes, variant filters and tissues. We find limited evidence that common cis-eQTLs modify penetrance of rare coding variants in a large cohort of NDD probands.


Subject(s)
Neurodevelopmental Disorders , Polymorphism, Single Nucleotide , Child , Humans , Penetrance , Quantitative Trait Loci/genetics , Neurodevelopmental Disorders/genetics , Transcriptome
2.
Nat Genet ; 56(2): 222-233, 2024 Feb.
Article in English | MEDLINE | ID: mdl-38177345

ABSTRACT

Most genome-wide association studies (GWAS) of major depression (MD) have been conducted in samples of European ancestry. Here we report a multi-ancestry GWAS of MD, adding data from 21 cohorts with 88,316 MD cases and 902,757 controls to previously reported data. This analysis used a range of measures to define MD and included samples of African (36% of effective sample size), East Asian (26%) and South Asian (6%) ancestry and Hispanic/Latin American participants (32%). The multi-ancestry GWAS identified 53 significantly associated novel loci. For loci from GWAS in European ancestry samples, fewer than expected were transferable to other ancestry groups. Fine mapping benefited from additional sample diversity. A transcriptome-wide association study identified 205 significantly associated novel genes. These findings suggest that, for MD, increasing ancestral and global diversity in genetic studies may be particularly important to ensure discovery of core genes and inform about transferability of findings.


Subject(s)
Depressive Disorder, Major , Genome-Wide Association Study , Humans , Genetic Predisposition to Disease , Depressive Disorder, Major/genetics , Depression , Chromosome Mapping , Polymorphism, Single Nucleotide/genetics
3.
Cell ; 186(21): 4514-4527.e14, 2023 10 12.
Article in English | MEDLINE | ID: mdl-37757828

ABSTRACT

Autozygosity is associated with rare Mendelian disorders and clinically relevant quantitative traits. We investigated associations between the fraction of the genome in runs of homozygosity (FROH) and common diseases in Genes & Health (n = 23,978 British South Asians), UK Biobank (n = 397,184), and 23andMe. We show that restricting analysis to offspring of first cousins is an effective way of reducing confounding due to social/environmental correlates of FROH. Within this group in G&H+UK Biobank, we found experiment-wide significant associations between FROH and twelve common diseases. We replicated associations with type 2 diabetes (T2D) and post-traumatic stress disorder via within-sibling analysis in 23andMe (median n = 480,282). We estimated that autozygosity due to consanguinity accounts for 5%-18% of T2D cases among British Pakistanis. Our work highlights the possibility of widespread non-additive genetic effects on common diseases and has important implications for global populations with high rates of consanguinity.


Subject(s)
Consanguinity , Diabetes Mellitus, Type 2 , Humans , Diabetes Mellitus, Type 2/genetics , Homozygote , Phenotype , Polymorphism, Single Nucleotide , Biological Specimen Banks , Genome, Human , Genetic Predisposition to Disease , United Kingdom
4.
Nat Genet ; 55(9): 1483-1493, 2023 09.
Article in English | MEDLINE | ID: mdl-37592024

ABSTRACT

Our understanding of the genetics of the human cerebral cortex is limited both in terms of the diversity and the anatomical granularity of brain structural phenotypes. Here we conducted a genome-wide association meta-analysis of 13 structural and diffusion magnetic resonance imaging-derived cortical phenotypes, measured globally and at 180 bilaterally averaged regions in 36,663 individuals and identified 4,349 experiment-wide significant loci. These phenotypes include cortical thickness, surface area, gray matter volume, measures of folding, neurite density and water diffusion. We identified four genetic latent structures and causal relationships between surface area and some measures of cortical folding. These latent structures partly relate to different underlying gene expression trajectories during development and are enriched for different cell types. We also identified differential enrichment for neurodevelopmental and constrained genes and demonstrate that common genetic variants associated with cortical expansion are associated with cephalic disorders. Finally, we identified complex interphenotype and inter-regional genetic relationships among the 13 phenotypes, reflecting the developmental differences among them. Together, these analyses identify distinct genetic organizational principles of the cortex and their correlates with neurodevelopment.


Subject(s)
Cerebral Cortex , Genome-Wide Association Study , Humans , Cerebral Cortex/diagnostic imaging , Brain/diagnostic imaging , Neuroimaging , Phenotype
5.
Trends Genet ; 39(11): 810-812, 2023 Nov.
Article in English | MEDLINE | ID: mdl-37596117

ABSTRACT

Twin and genomic studies indicate that genes play an important role in the development of cognitive ability. However, data limitations have made it difficult to pinpoint specific genes with a large impact. By examining the full gene sequences of >300 000 individuals, Chen et al. find eight such genes.

6.
J Clin Endocrinol Metab ; 108(12): e1580-e1587, 2023 Nov 17.
Article in English | MEDLINE | ID: mdl-37339320

ABSTRACT

CONTEXT: The melanocortin 3 receptor (MC3R) has recently emerged as a critical regulator of pubertal timing, linear growth, and the acquisition of lean mass in humans and mice. In population-based studies, heterozygous carriers of deleterious variants in MC3R report a later onset of puberty than noncarriers. However, the frequency of such variants in patients who present with clinical disorders of pubertal development is currently unknown. OBJECTIVE: This work aimed to determine whether deleterious MC3R variants are more frequently found in patients clinically presenting with constitutional delay of growth and puberty (CDGP) or normosmic idiopathic hypogonadotropic hypogonadism (nIHH). METHODS: We examined the sequence of MC3R in 362 adolescents with a clinical diagnosis of CDGP and 657 patients with nIHH, experimentally characterized the signaling properties of all nonsynonymous variants found and compared their frequency to that in 5774 controls from a population-based cohort. Additionally, we established the relative frequency of predicted deleterious variants in individuals with self-reported delayed vs normally timed menarche/voice-breaking in the UK Biobank cohort. RESULTS: MC3R loss-of-function variants were infrequent but overrepresented in patients with CDGP (8/362 [2.2%]; OR = 4.17; P = .001). There was no strong evidence of overrepresentation in patients with nIHH (4/657 [0.6%]; OR = 1.15; P = .779). In 246 328 women from the UK Biobank, predicted deleterious variants were more frequently found in those self-reporting delayed (aged ≥16 years) vs normal age at menarche (OR = 1.66; P = 3.90E-07). CONCLUSION: We have found evidence that functionally damaging variants in MC3R are overrepresented in individuals with CDGP but are not a common cause of this phenotype.


Subject(s)
Hypogonadism , Puberty, Delayed , Adolescent , Humans , Female , Animals , Mice , Receptor, Melanocortin, Type 3 , Prevalence , Hypogonadism/epidemiology , Hypogonadism/genetics , Hypogonadism/complications , Puberty, Delayed/epidemiology , Puberty, Delayed/genetics , Puberty, Delayed/diagnosis , Puberty/genetics , Growth Disorders/genetics
8.
Nat Med ; 29(6): 1540-1549, 2023 Jun.
Article in English | MEDLINE | ID: mdl-37248299

ABSTRACT

Preeclampsia and gestational hypertension are common pregnancy complications associated with adverse maternal and child outcomes. Current tools for prediction, prevention and treatment are limited. Here we tested the association of maternal DNA sequence variants with preeclampsia in 20,064 cases and 703,117 control individuals and with gestational hypertension in 11,027 cases and 412,788 control individuals across discovery and follow-up cohorts using multi-ancestry meta-analysis. Altogether, we identified 18 independent loci associated with preeclampsia/eclampsia and/or gestational hypertension, 12 of which are new (for example, MTHFR-CLCN6, WNT3A, NPR3, PGR and RGL3), including two loci (PLCE1 and FURIN) identified in the multitrait analysis. Identified loci highlight the role of natriuretic peptide signaling, angiogenesis, renal glomerular function, trophoblast development and immune dysregulation. We derived genome-wide polygenic risk scores that predicted preeclampsia/eclampsia and gestational hypertension in external cohorts, independent of clinical risk factors, and reclassified eligibility for low-dose aspirin to prevent preeclampsia. Collectively, these findings provide mechanistic insights into the hypertensive disorders of pregnancy and have the potential to advance pregnancy risk stratification.


Subject(s)
Eclampsia , Hypertension, Pregnancy-Induced , Hypertension , Pre-Eclampsia , Pregnancy , Female , Child , Humans , Hypertension, Pregnancy-Induced/genetics , Pre-Eclampsia/genetics , Pre-Eclampsia/prevention & control , Aspirin , Risk Factors
9.
N Engl J Med ; 388(17): 1559-1571, 2023 Apr 27.
Article in English | MEDLINE | ID: mdl-37043637

ABSTRACT

BACKGROUND: Pediatric disorders include a range of highly penetrant, genetically heterogeneous conditions amenable to genomewide diagnostic approaches. Finding a molecular diagnosis is challenging but can have profound lifelong benefits. METHODS: We conducted a large-scale sequencing study involving more than 13,500 families with probands with severe, probably monogenic, difficult-to-diagnose developmental disorders from 24 regional genetics services in the United Kingdom and Ireland. Standardized phenotypic data were collected, and exome sequencing and microarray analyses were performed to investigate novel genetic causes. We developed an iterative variant analysis pipeline and reported candidate variants to clinical teams for validation and diagnostic interpretation to inform communication with families. Multiple regression analyses were performed to evaluate factors affecting the probability of diagnosis. RESULTS: A total of 13,449 probands were included in the analyses. On average, we reported 1.0 candidate variant per parent-offspring trio and 2.5 variants per singleton proband. Using clinical and computational approaches to variant classification, we made a diagnosis in approximately 41% of probands (5502 of 13,449). Of 3599 probands in trios who received a diagnosis by clinical assertion, approximately 76% had a pathogenic de novo variant. Another 22% of probands (2997 of 13,449) had variants of uncertain significance in genes that were strongly linked to monogenic developmental disorders. Recruitment in a parent-offspring trio had the largest effect on the probability of diagnosis (odds ratio, 4.70; 95% confidence interval [CI], 4.16 to 5.31). Probands were less likely to receive a diagnosis if they were born extremely prematurely (i.e., 22 to 27 weeks' gestation; odds ratio, 0.39; 95% CI, 0.22 to 0.68), had in utero exposure to antiepileptic medications (odds ratio, 0.44; 95% CI, 0.29 to 0.67), had mothers with diabetes (odds ratio, 0.52; 95% CI, 0.41 to 0.67), or were of African ancestry (odds ratio, 0.51; 95% CI, 0.31 to 0.78). CONCLUSIONS: Among probands with severe, probably monogenic, difficult-to-diagnose developmental disorders, multimodal analysis of genomewide data had good diagnostic power, even after previous attempts at diagnosis. (Funded by the Health Innovation Challenge Fund and Wellcome Sanger Institute.).


Subject(s)
Genomics , Rare Diseases , Child , Humans , Exome , Ireland/epidemiology , United Kingdom/epidemiology , Rare Diseases/diagnosis , Rare Diseases/epidemiology , Rare Diseases/genetics , Oligonucleotide Array Sequence Analysis , Genetic Association Studies , Neurodevelopmental Disorders/diagnosis , Neurodevelopmental Disorders/genetics , Congenital Abnormalities/diagnosis , Congenital Abnormalities/genetics , Growth Disorders/diagnosis , Growth Disorders/genetics , Facies , Child Behavior Disorders/diagnosis , Child Behavior Disorders/genetics , Genetic Diseases, Inborn/diagnosis , Genetic Diseases, Inborn/genetics
10.
Genome Biol ; 23(1): 268, 2022 12 27.
Article in English | MEDLINE | ID: mdl-36575460

ABSTRACT

BACKGROUND: Genetic variants within nearly 1000 loci are known to contribute to modulation of blood lipid levels. However, the biological pathways underlying these associations are frequently unknown, limiting understanding of these findings and hindering downstream translational efforts such as drug target discovery. RESULTS: To expand our understanding of the underlying biological pathways and mechanisms controlling blood lipid levels, we leverage a large multi-ancestry meta-analysis (N = 1,654,960) of blood lipids to prioritize putative causal genes for 2286 lipid associations using six gene prediction approaches. Using phenome-wide association (PheWAS) scans, we identify relationships of genetically predicted lipid levels to other diseases and conditions. We confirm known pleiotropic associations with cardiovascular phenotypes and determine novel associations, notably with cholelithiasis risk. We perform sex-stratified GWAS meta-analysis of lipid levels and show that 3-5% of autosomal lipid-associated loci demonstrate sex-biased effects. Finally, we report 21 novel lipid loci identified on the X chromosome. Many of the sex-biased autosomal and X chromosome lipid loci show pleiotropic associations with sex hormones, emphasizing the role of hormone regulation in lipid metabolism. CONCLUSIONS: Taken together, our findings provide insights into the biological mechanisms through which associated variants lead to altered lipid levels and potentially cardiovascular disease risk.


Subject(s)
Genetic Predisposition to Disease , Genome-Wide Association Study , Humans , Sex Characteristics , Phenotype , Lipids/genetics , Polymorphism, Single Nucleotide , Genetic Pleiotropy
11.
Nat Commun ; 13(1): 4664, 2022 08 09.
Article in English | MEDLINE | ID: mdl-35945198

ABSTRACT

Individuals with South Asian ancestry have a higher risk of heart disease than other groups but have been largely excluded from genetic research. Using data from 22,000 British Pakistani and Bangladeshi individuals with linked electronic health records from the Genes & Health cohort, we conducted genome-wide association studies of coronary artery disease and its key risk factors. Using power-adjusted transferability ratios, we found evidence for transferability for the majority of cardiometabolic loci powered to replicate. The performance of polygenic scores was high for lipids and blood pressure, but lower for BMI and coronary artery disease. Adding a polygenic score for coronary artery disease to clinical risk factors showed significant improvement in reclassification. In Mendelian randomisation using transferable loci as instruments, our findings were consistent with results in European-ancestry individuals. Taken together, trait-specific transferability of trait loci between populations is an important consideration with implications for risk prediction and causal inference.


Subject(s)
Coronary Artery Disease , Genome-Wide Association Study , Asian People/genetics , Coronary Artery Disease/epidemiology , Coronary Artery Disease/genetics , Genetic Loci , Humans , Pakistan , Polymorphism, Single Nucleotide
12.
Am J Hum Genet ; 109(8): 1366-1387, 2022 08 04.
Article in English | MEDLINE | ID: mdl-35931049

ABSTRACT

A major challenge of genome-wide association studies (GWASs) is to translate phenotypic associations into biological insights. Here, we integrate a large GWAS on blood lipids involving 1.6 million individuals from five ancestries with a wide array of functional genomic datasets to discover regulatory mechanisms underlying lipid associations. We first prioritize lipid-associated genes with expression quantitative trait locus (eQTL) colocalizations and then add chromatin interaction data to narrow the search for functional genes. Polygenic enrichment analysis across 697 annotations from a host of tissues and cell types confirms the central role of the liver in lipid levels and highlights the selective enrichment of adipose-specific chromatin marks in high-density lipoprotein cholesterol and triglycerides. Overlapping transcription factor (TF) binding sites with lipid-associated loci identifies TFs relevant in lipid biology. In addition, we present an integrative framework to prioritize causal variants at GWAS loci, producing a comprehensive list of candidate causal genes and variants with multiple layers of functional evidence. We highlight two of the prioritized genes, CREBRF and RRBP1, which show convergent evidence across functional datasets supporting their roles in lipid biology.


Subject(s)
Genome-Wide Association Study , Polymorphism, Single Nucleotide , Chromatin/genetics , Genomics , Humans , Lipids/genetics , Polymorphism, Single Nucleotide/genetics
13.
Genome Med ; 14(1): 73, 2022 07 19.
Article in English | MEDLINE | ID: mdl-35850704

ABSTRACT

BACKGROUND: The majority of clinical genetic testing focuses almost exclusively on regions of the genome that directly encode proteins. The important role of variants in non-coding regions in penetrant disease is, however, increasingly being demonstrated, and the use of whole genome sequencing in clinical diagnostic settings is rising across a large range of genetic disorders. Despite this, there is no existing guidance on how current guidelines designed primarily for variants in protein-coding regions should be adapted for variants identified in other genomic contexts. METHODS: We convened a panel of nine clinical and research scientists with wide-ranging expertise in clinical variant interpretation, with specific experience in variants within non-coding regions. This panel discussed and refined an initial draft of the guidelines which were then extensively tested and reviewed by external groups. RESULTS: We discuss considerations specifically for variants in non-coding regions of the genome. We outline how to define candidate regulatory elements, highlight examples of mechanisms through which non-coding region variants can lead to penetrant monogenic disease, and outline how existing guidelines can be adapted for the interpretation of these variants. CONCLUSIONS: These recommendations aim to increase the number and range of non-coding region variants that can be clinically interpreted, which, together with a compatible phenotype, can lead to new diagnoses and catalyse the discovery of novel disease mechanisms.


Subject(s)
Genetic Variation , Genome-Wide Association Study , Genome , Open Reading Frames , Regulatory Sequences, Nucleic Acid
14.
Nat Genet ; 54(9): 1293-1304, 2022 09.
Article in English | MEDLINE | ID: mdl-35654973

ABSTRACT

The substantial phenotypic heterogeneity in autism limits our understanding of its genetic etiology. To address this gap, here we investigated genetic differences between autistic individuals (nmax = 12,893) based on core and associated features of autism, co-occurring developmental disabilities and sex. We conducted a comprehensive factor analysis of core autism features in autistic individuals and identified six factors. Common genetic variants were associated with the core factors, but de novo variants were not. We found that higher autism polygenic scores (PGS) were associated with lower likelihood of co-occurring developmental disabilities in autistic individuals. Furthermore, in autistic individuals without co-occurring intellectual disability (ID), autism PGS are overinherited by autistic females compared to males. Finally, we observed higher SNP heritability for autistic males and for autistic individuals without ID. Deeper phenotypic characterization will be critical in determining how the complex underlying genetics shape cognition, behavior and co-occurring conditions in autism.


Subject(s)
Autism Spectrum Disorder , Autistic Disorder , Intellectual Disability , Autism Spectrum Disorder/genetics , Autistic Disorder/genetics , Cognition , Female , Humans , Intellectual Disability/genetics , Male
15.
PLoS Med ; 19(5): e1003981, 2022 05.
Article in English | MEDLINE | ID: mdl-35587468

ABSTRACT

BACKGROUND: Type 2 diabetes (T2D) is highly prevalent in British South Asians, yet they are underrepresented in research. Genes & Health (G&H) is a large, population study of British Pakistanis and Bangladeshis (BPB) comprising genomic and routine health data. We assessed the extent to which genetic risk for T2D is shared between BPB and European populations (EUR). We then investigated whether the integration of a polygenic risk score (PRS) for T2D with an existing risk tool (QDiabetes) could improve prediction of incident disease and the characterisation of disease subtypes. METHODS AND FINDINGS: In this observational cohort study, we assessed whether common genetic loci associated with T2D in EUR individuals were replicated in 22,490 BPB individuals in G&H. We replicated fewer loci in G&H (n = 76/338, 22%) than would be expected given power if all EUR-ascertained loci were transferable (n = 101, 30%; p = 0.001). Of the 27 transferable loci that were powered to interrogate this, only 9 showed evidence of shared causal variants. We constructed a T2D PRS and combined it with a clinical risk instrument (QDiabetes) in a novel, integrated risk tool (IRT) to assess risk of incident diabetes. To assess model performance, we compared categorical net reclassification index (NRI) versus QDiabetes alone. In 13,648 patients free from T2D followed up for 10 years, NRI was 3.2% for IRT versus QDiabetes (95% confidence interval (CI): 2.0% to 4.4%). IRT performed best in reclassification of individuals aged less than 40 years deemed low risk by QDiabetes alone (NRI 5.6%, 95% CI 3.6% to 7.6%), who tended to be free from comorbidities and slim. After adjustment for QDiabetes score, PRS was independently associated with progression to T2D after gestational diabetes (hazard ratio (HR) per SD of PRS 1.23, 95% CI 1.05 to 1.42, p = 0.028). Using cluster analysis of clinical features at diabetes diagnosis, we replicated previously reported disease subgroups, including Mild Age-Related, Mild Obesity-related, and Insulin-Resistant Diabetes, and showed that PRS distribution differs between subgroups (p = 0.002). Integrating PRS in this cluster analysis revealed a Probable Severe Insulin Deficient Diabetes (pSIDD) subgroup, despite the absence of clinical measures of insulin secretion or resistance. We also observed differences in rates of progression to micro- and macrovascular complications between subgroups after adjustment for confounders. Study limitations include the absence of an external replication cohort and the potential biases arising from missing or incorrect routine health data. CONCLUSIONS: Our analysis of the transferability of T2D loci between EUR and BPB indicates the need for larger, multiancestry studies to better characterise the genetic contribution to disease and its varied aetiology. We show that a T2D PRS optimised for this high-risk BPB population has potential clinical application in BPB, improving the identification of T2D risk (especially in the young) on top of an established clinical risk algorithm and aiding identification of subgroups at diagnosis, which may help future efforts to stratify care and treatment of the disease.


Subject(s)
Diabetes Mellitus, Type 2 , Asian People , Cohort Studies , Diabetes Mellitus, Type 2/diagnosis , Diabetes Mellitus, Type 2/epidemiology , Diabetes Mellitus, Type 2/genetics , Female , Humans , Insulin , Pakistan/epidemiology , Risk Factors
16.
Nature ; 603(7903): 858-863, 2022 03.
Article in English | MEDLINE | ID: mdl-35322230

ABSTRACT

Genome-wide sequencing of human populations has revealed substantial variation among genes in the intensity of purifying selection acting on damaging genetic variants1. Although genes under the strongest selective constraint are highly enriched for associations with Mendelian disorders, most of these genes are not associated with disease and therefore the nature of the selection acting on them is not known2. Here we show that genetic variants that damage these genes are associated with markedly reduced reproductive success, primarily owing to increased childlessness, with a stronger effect in males than in females. We present evidence that increased childlessness is probably mediated by genetically associated cognitive and behavioural traits, which may mean that male carriers are less likely to find reproductive partners. This reduction in reproductive success may account for 20% of purifying selection against heterozygous variants that ablate protein-coding genes. Although this genetic association may only account for a very minor fraction of the overall likelihood of being childless (less than 1%), especially when compared to more influential sociodemographic factors, it may influence how genes evolve over time.


Subject(s)
Reproduction , Selection, Genetic , Chromosome Mapping , Female , Heterozygote , Humans , Male , Phenotype , Reproduction/genetics
17.
Nature ; 600(7890): 675-679, 2021 12.
Article in English | MEDLINE | ID: mdl-34887591

ABSTRACT

Increased blood lipid levels are heritable risk factors of cardiovascular disease with varied prevalence worldwide owing to different dietary patterns and medication use1. Despite advances in prevention and treatment, in particular through reducing low-density lipoprotein cholesterol levels2, heart disease remains the leading cause of death worldwide3. Genome-wideassociation studies (GWAS) of blood lipid levels have led to important biological and clinical insights, as well as new drug targets, for cardiovascular disease. However, most previous GWAS4-23 have been conducted in European ancestry populations and may have missed genetic variants that contribute to lipid-level variation in other ancestry groups. These include differences in allele frequencies, effect sizes and linkage-disequilibrium patterns24. Here we conduct a multi-ancestry, genome-wide genetic discovery meta-analysis of lipid levels in approximately 1.65 million individuals, including 350,000 of non-European ancestries. We quantify the gain in studying non-European ancestries and provide evidence to support the expansion of recruitment of additional ancestries, even with relatively small sample sizes. We find that increasing diversity rather than studying additional individuals of European ancestry results in substantial improvements in fine-mapping functional variants and portability of polygenic prediction (evaluated in approximately 295,000 individuals from 7 ancestry groupings). Modest gains in the number of discovered loci and ancestry-specific variants were also achieved. As GWAS expand emphasis beyond the identification of genes and fundamental biology towards the use of genetic variants for preventive and precision medicine25, we anticipate that increased diversity of participants will lead to more accurate and equitable26 application of polygenic scores in clinical practice.


Subject(s)
Cardiovascular Diseases , Genome-Wide Association Study , Cardiovascular Diseases/genetics , Genetic Predisposition to Disease/genetics , Genome-Wide Association Study/methods , Humans , Linkage Disequilibrium , Multifactorial Inheritance , Polymorphism, Single Nucleotide/genetics , Population Groups
18.
Nat Commun ; 12(1): 7189, 2021 12 10.
Article in English | MEDLINE | ID: mdl-34893604

ABSTRACT

Previous genetic and public health research in the Pakistani population has focused on the role of consanguinity in increasing recessive disease risk, but little is known about its recent population history or the effects of endogamy. Here, we investigate fine-scale population structure, history and consanguinity patterns using genotype chip data from 2,200 British Pakistanis. We reveal strong recent population structure driven by the biraderi social stratification system. We find that all subgroups have had low recent effective population sizes (Ne), with some showing a decrease 15‒20 generations ago that has resulted in extensive identity-by-descent sharing and homozygosity, increasing the risk of recessive disorders. Our results from two orthogonal methods (one using machine learning and the other coalescent-based) suggest that the detailed reporting of parental relatedness for mothers in the cohort under-represents the true levels of consanguinity. These results demonstrate the impact of cultural practices on population structure and genomic diversity in Pakistanis, and have important implications for medical genetic studies.


Subject(s)
Asian People/genetics , Consanguinity , Genetics, Population , White People/genetics , Cohort Studies , Demography , Genotype , Homozygote , Humans , Marriage , Models, Genetic , Pakistan , Parents , Population Density , Social Status
19.
Am J Hum Genet ; 108(11): 2186-2194, 2021 11 04.
Article in English | MEDLINE | ID: mdl-34626536

ABSTRACT

Structural variation (SV) describes a broad class of genetic variation greater than 50 bp in size. SVs can cause a wide range of genetic diseases and are prevalent in rare developmental disorders (DDs). Individuals presenting with DDs are often referred for diagnostic testing with chromosomal microarrays (CMAs) to identify large copy-number variants (CNVs) and/or with single-gene, gene-panel, or exome sequencing (ES) to identify single-nucleotide variants, small insertions/deletions, and CNVs. However, individuals with pathogenic SVs undetectable by conventional analysis often remain undiagnosed. Consequently, we have developed the tool InDelible, which interrogates short-read sequencing data for split-read clusters characteristic of SV breakpoints. We applied InDelible to 13,438 probands with severe DDs recruited as part of the Deciphering Developmental Disorders (DDD) study and discovered 63 rare, damaging variants in genes previously associated with DDs missed by standard SNV, indel, or CNV discovery approaches. Clinical review of these 63 variants determined that about half (30/63) were plausibly pathogenic. InDelible was particularly effective at ascertaining variants between 21 and 500 bp in size and increased the total number of potentially pathogenic variants identified by DDD in this size range by 42.9%. Of particular interest were seven confirmed de novo variants in MECP2, which represent 35.0% of all de novo protein-truncating variants in MECP2 among DDD study participants. InDelible provides a framework for the discovery of pathogenic SVs that are most likely missed by standard analytical workflows and has the potential to improve the diagnostic yield of ES across a broad range of genetic diseases.


Subject(s)
Developmental Disabilities/diagnosis , Developmental Disabilities/genetics , Exome Sequencing/methods , Child , Female , Humans , Male , Methyl-CpG-Binding Protein 2/genetics
20.
Cell ; 184(18): 4612-4625.e14, 2021 09 02.
Article in English | MEDLINE | ID: mdl-34352227

ABSTRACT

The Middle East region is important to understand human evolution and migrations but is underrepresented in genomic studies. Here, we generated 137 high-coverage physically phased genome sequences from eight Middle Eastern populations using linked-read sequencing. We found no genetic traces of early expansions out-of-Africa in present-day populations but found Arabians have elevated Basal Eurasian ancestry that dilutes their Neanderthal ancestry. Population sizes within the region started diverging 15-20 kya, when Levantines expanded while Arabians maintained smaller populations that derived ancestry from local hunter-gatherers. Arabians suffered a population bottleneck around the aridification of Arabia 6 kya, while Levantines had a distinct bottleneck overlapping the 4.2 kya aridification event. We found an association between movement and admixture of populations in the region and the spread of Semitic languages. Finally, we identify variants that show evidence of selection, including polygenic selection. Our results provide detailed insights into the genomic and selective histories of the Middle East.


Subject(s)
Genetics, Population/history , Genome, Human , Animals , Chromosomes, Human, Y/genetics , Databases, Genetic , Gene Pool , Genetic Introgression , Geography , History, Ancient , Human Migration , Humans , Middle East , Models, Genetic , Neanderthals/genetics , Phylogeny , Population Density , Selection, Genetic , Sequence Analysis, DNA
SELECTION OF CITATIONS
SEARCH DETAIL
...